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Abstract 

Random  geometric  graphs  have  been  one  of  the  fundamental  models  for  reasoning  about  wireless 
networks:  one  places  n  points  at  random  in  a  region  of  the  plane  (typically  a  square  or  circle),  and  then 
connects  pairs  of  points  by  an  edge  if  they  are  within  a  fixed  distance  of  one  another.  In  addition  to  giving 
rise  to  a  range  of  basic  theoretical  questions,  this  class  of  random  graphs  has  been  a  central  analytical 
tool  in  the  wireless  networking  community. 

For  many  of  the  primary  applications  of  wireless  networks,  however,  the  underlying  environment 
has  a  large  number  of  obstacles,  and  communication  can  only  take  place  among  nodes  when  they  are 
close  in  space  and  when  they  have  line-of-sight  access  to  one  another  —  consider,  for  example,  urban 
settings  or  large  indoor  environments.  In  such  domains,  the  standard  model  of  random  geometric  graphs 
is  not  a  good  approximation  of  the  true  constraints,  since  it  is  not  designed  to  capture  the  line-of-sight 
restrictions. 

Here  we  propose  a  random-graph  model  incorporating  both  range  limitations  and  line-of-sight  con¬ 
straints,  and  we  prove  asymptotically  tight  results  for  /e-connectivity.  Specifically,  we  consider  points 
placed  randomly  on  a  grid  (or  torus),  such  that  each  node  can  see  up  to  a  fixed  distance  along  the  row 
and  column  it  belongs  to.  (We  think  of  the  rows  and  columns  as  “streets”  and  “avenues”  among  a  reg¬ 
ularly  spaced  array  of  obstructions.)  Further,  we  show  that  when  the  probability  of  node  placement  is  a 
constant  factor  larger  than  the  threshold  for  connectivity,  near-shortest  paths  between  pairs  of  nodes  can 
be  found,  with  high  probability,  by  an  algorithm  using  only  local  information.  In  addition  to  analyzing 
connectivity  and  ^-connectivity,  we  also  study  the  emergence  of  a  giant  component,  as  well  an  approx¬ 
imation  question,  in  which  we  seek  to  connect  a  set  of  given  nodes  in  such  an  environment  by  adding  a 
small  set  of  additional  “relay”  nodes. 
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1  Introduction 


Most  of  today’s  approaches  to  wireless  computing  and  communications  arc  built  on  architectures  where  base 
stations  connect  the  wireless  devices  to  a  supporting  infrastructure.  However,  since  the  overwhelming  trend 
is  to  transmit  information  in  packets,  over  standard  protocols,  a  dominant  focus  in  the  wireless  research 
community  is  on  more  decentralized  approaches  where  nodes  cooperate  to  relay  packets  on  behalf  of  other 
nodes.  This  focus  is  at  the  heart  of  current  work  on  mobile  ad  hoc  networks  (MANETs)  [17,  18]. 

Such  networks  can  be  viewed  as  consisting  of  a  collection  of  nodes,  representing  wireless  devices, 
positioned  at  various  points  in  some  physical  region.  The  (wireless)  “links”  of  the  network,  joining  pairs 
of  nodes  that  can  directly  communicate  with  one  another,  are  predominantly  short-range  and  constrained 
by  line-of-sight;  this  is  an  inevitable  result  of  the  scarcity  of  radio  frequency  (RF)  spectrum  and  physical 
constraints  on  the  propagation  of  RF  and  optical  signals.  The  ways  in  which  these  physical  limits  on  direct 
communication  affect  the  overall  performance  of  the  network  is  a  fundamental  issue  that  motivates  much  of 
the  theoretical  work  in  this  area. 

Random  Geometric  Graphs.  Given  this  framework,  random  geometric  graphs  have  emerged  as  a  dom¬ 
inant  model  for  theoretical  analysis  of  distributed  wireless  networks.  One  places  n  points  uniformly  at 
random  in  a  geometric  region  (typically  a  disc  or  a  square),  and  then,  for  a  range  parameter  r,  one  connects 
each  pair  of  nodes  that  arc  within  distance  r  of  one  another.  This  model  is  the  subject  of  a  book  by  Penrose 
[20],  and  we  refer  the  reader  there  for  extensive  background;  we  also  note  that  the  enormously  influential 
work  of  Gupta  and  Kumar  on  the  capacity  of  wireless  networks  is  framed  in  this  model  as  well  [13,  14]. 

One  of  the  most  basic  questions  is  to  determine  how  the  probability  of  connectivity  of  a  random  geo¬ 
metric  graph  depends  on  the  number  of  nodes  n  and  the  range  parameter  r.  A  canonical  result  here  is  the 
following  theorem  of  Penrose  [19].  If  we  place  n  points  uniformly  at  random  in  a  unit  square,  and  then 
continuously  increase  the  range  parameter  r,  with  high  probability  the  resulting  geometric  graph  becomes 
^-connected  at  the  smallest  value  of  r  for  which  there  are  no  nodes  of  degree  <  k.  In  other  words,  the 
graph  becomes  /;: -connected  at  the  moment  that  all  trivial  obstacles  to  ^-connectivity  (i.e.  low-degree  nodes) 
disappear.  An  analogous  type  of  result  is  familial-  from  classical  Erdos-Renyi  random  graph  models  [4]. 
(For  further  results  and  discussion  on  thresholds  in  random  geometic  graphs,  see  Goel  et  al.  [11].) 

For  modeling  distributed  wireless  networks,  the  assumption  of  random  node  placement  has  proved  to 
be  a  reasonable  abstraction  for  the  lack  of  structure  in  node  locations,  given  that  most  frameworks  for  ad 
hoc  networks  assume  some  arbitrary  initial  “scattering”  of  nodes,  or  that  nodes  reach  their  positions  as  a 
result  of  arbitrary  mobility.  More  problematic  is  the  fact  that  the  analysis  takes  place  in  regions  with  no 
obstructions  —  in  other  words,  that  a  node  can  communicate  with  all  other  nodes  within  distance  r.  This  is 
at  odds  with  the  underlying  constraints  in  many  applications  of  distributed  wireless  networks,  where  there 
can  generally  be  a  large  number  of  obstructions  limiting  communication  between  nearby  nodes  due  to  a  lack 
of  direct  line-of-sight  contact. 

In  other  words,  while  random  geometric  graphs  model  wireless  networks  in  open  spaces,  we  lack  a 
corresponding  model  for  wireless  networks  in  some  of  their  most  common  domains:  urban  settings,  large 
indoor  environments,  or  any  other  context  in  which  there  are  obstacles  limiting  visibility.  With  such  a  model 
would  come  the  ability  to  address  a  range  of  basic  theoretical  problems.  In  particular,  we  are  guided  by  the 
following  genre  of  question: 

How  do  connectivity  and  other  structural  properties  of  random  geometric  graphs  change  once 

we  introduce  line-of-sight  constraints? 

An  understanding  of  such  issues  could  help  provide  a  framework  for  reasoning  more  generally  about  the 
performance  of  distributed  wireless  networks  in  obstructed  environments. 
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The  present  work:  Connectivity  in  line-of-sight  networks.  In  this  paper,  we  propose  a  random-graph 
model  incorporating  both  range  limitations  and  line-of-sight  constraints,  and  we  prove  asymptotically  tight 
results  for  /c-connectivity.  We  also  consider  related  structural  questions,  including  the  emergence  of  a  giant 
component,  as  well  as  some  of  the  algorithmic  issues  raised  by  the  model. 

To  motivate  the  model,  consider  a  stylized  abstraction  of  limited-range  wireless  communication  in  an 
urban  environment:  there  arc  n  streets  running  east-west,  n  avenues  running  north-south,  and  wireless  nodes 
can  be  placed  at  intersections  of  streets  and  avenues.  Each  node  has  range  ui  —  it  can  see  up  to  uj  blocks 
north  and  south  along  the  avenue  it  lies  on,  and  up  to  u  blocks  east  and  west  along  the  street  it  lies  on. 

More  concretely,  we  have  an  underlying  set  T  of  lattice  points  {(x,y)  :  x,y  €  {1,2, .. .  ,n}}.  We 
measure  distance  using  the  L  \  metric,  though  to  prevent  complications  arising  from  boundary  effects  in  this 
presentation,  we  define  the  distance  between  points  as  though  they  form  a  torus: 

d((x,  y),  (xr ,  y'))  =  min(|x  —  x'\,n  —  \x  —  x'\)  +  min(|y  —  y'\,n  —  \y  —  y'\ ). 


For  a  specified  range  parameter  uj,  we  say  that  two  points  are  mutually  visible  if  they  are  in  the  same  row  or 
the  same  column  of  the  torus,  and  if  they  are  within  distance  at  most  a j  from  one  another.  We  view  the  range 
u i  as  implicitly  being  a  function  of  n,  and  in  this  paper  we  will  make  the  assumption  that  to  is  asymptotically 
bounded  below  by  In  n  and  above  by  some  polynomial  in  n;  specifically,  we  assume  In  n  =  o(ut)  and  that 
0 o  =  0(ns )  for  a  value  of  5  <  1  to  be  specified  below. 

We  now  study  the  random  graph  G  that  results  if,  for  some  placement  probability  p  >  0,  we  locate  a 
node  at  each  point  of  T  independently  with  probability  p,  and  then  connect  those  pairs  of  nodes  that  arc 
mutually  visible.  As  p  increases,  the  torus  becomes  more  crowded  with  nodes,  and  the  resulting  graph  G 
is  more  likely  to  be  connected.  Our  main  result  states,  roughly,  that  the  smallest  value  of  p  at  which  G 
becomes  fc-connected  with  high  probability  is  asymptotically  the  same  as  the  smallest  value  of  p  at  which 
the  minimum  degree  in  G  is  k  with  high  probability. 

More  concretely,  for  a  critical  value  of  the  placement  probability  p*  =  0(— ),  we  find  that  in  an 
interval  of  width  O ( )  around  p*,  the  random  graph  G  goes  from  being  /r-conncctcd  with  arbitrarily  small 
probability  to  being  /r-conncctcd  with  probability  arbitrarily  close  to  1.  Moreover,  the  probability  that  G  has 
no  nodes  of  degree  <  k  undergoes  a  comparable  transition  in  a  corresponding  interval  around  p*.  We  state 
this  theorem  about  fc-connectivity  as  follows.  First,  we  write  uj  =  ns  where  we  assume  that  uj  A>  hi  n  and 
5  <  yjp-j-  Note  that  we  do  not  preclude  the  case  where  5  =  o(l). 


Theorem  1.1  Let  k  >  1  be  a  fixed  positive  integer  and  let  p 


(1—  |<5)  In  71+ In  In  n+c„ 
2  to 


.  Then 


lim  Pr(G  is  k-connected ) 

n— xx) 


0  cn  — >  — oo 
<  e~Xk  cn  — *  c 
1  cn  *  oo 


where 

_  2fc-2(l  -  k5)ke~2c 
Xk=  • 

The  proof  of  this  result,  which  occupies  Section  2  of  the  paper,  requires  techniques  quite  different  from 
the  analysis  of  standard  geometric  random  graphs,  due  to  the  line-of-sight  constraints.  One  way  to  appreciate 
why  this  appeal's  necessary  is  to  consider  that,  as  we  vary  uj,  the  resulting  model  interpolates  between  two 
well-known,  but  qualitatively  different  random  graph  models.  When  u  =  1,  so  that  a  node  can  only  see 
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neighboring  points,  we  have  site  percolation  on  a  lattice,  a  well-studied  problem  that  is  still  not  completely 
well  understood.  At  the  other  extreme,  when  oj  =  n  and  nodes  can  see  all  points  in  their  row  and  column,  it 
is  easy  to  see  that  the  model  is  equivalent  to  a  purely  graph-theoretic  one  in  which  we  start  with  the  complete 
bipartite  graph  Kn,n  and  keep  each  edge  with  probability  p.  Note  that  our  bounds  on  oj  preclude  either  of 
these  exact  extremes,  but  our  analysis  for  the  “middle  region”  of  oj  that  we  consider  involves  ingredients 
from  both  extremes,  combining  techniques  from  “classical”  random  graph  analysis  with  the  combinatorics 
of  the  underlying  grid  of  points. 

The  present  work:  Further  results.  We  consider  the  emergence  of  a  giant  component  in  our  model.  We 
prove  that  if  p  =  c/oj  for  c  >  1  and  oj  — >  oo,  then  with  high  probability  G  contains  a  component  with  a 
lineal-  fraction  of  all  the  nodes. 

We  also  consider  the  problem  of  how  nodes  in  such  a  random  graph  can  construct  paths  between  each 
other,  possessing  knowledge  of  their  own  coordinates  but  otherwise  having  only  local  information.  We 
show  that  when  p  exceeds  the  threshold  for  connectivity  by  a  fixed  (relatively  small)  constant  factor  — 
i.e.  p  =  C  In  n /oj  —  then  a  simple  decentralized  algorithm  allows  a  given  pair  of  nodes  at  Li-distance  d 
to  construct,  with  high  probability,  a  path  of  O{d/oj  +  Inn)  edges  while  involving  only  O(d/oj  +  win n) 
nodes  in  the  computation.  This  is  nearly  optimal,  even  with  global  information,  since  il(d/uj)  is  a  simple 
lower  bound  on  the  length  of  any  path  between  nodes  at  Li-distance  d  (and  hence  also  a  lower  bound  on  the 
number  of  nodes  who  need  to  participate  in  the  construction  of  the  path). 

Finally,  we  consider  a  basic  algorithmic  problem  in  a  non-random  version  of  the  line-of-sight  model: 
given  an  input  set  of  nodes,  we  would  like  to  add  a  small  set  of  additional  nodes  so  that  the  full  set  becomes 
connected.  More  concretely,  suppose  we  are  given  a  set  of  nodes  at  points  X  C  T,  such  that  the  graph  on  X 
(defined  by  visibility  with  respect  to  the  range  parameter  oj)  is  not  connected.  We  would  like  to  add  further 
nodes,  at  a  set  Y  C  T,  where  Y  should  be  as  small  as  possible  subject  to  the  constraint  that  the  graph  on 
IU  Y  should  be  connected.  We  think  of  the  additional  nodes  Y  as  “relays”  that  connect  the  original  nodes 
in  X  under  line-of-sight  constraints;  as  a  result,  we  refer  to  this  as  the  Relay  Placement  problem. 

By  considering  the  graph  of  mutual  visibility,  and  viewing  the  nodes  in  Y  as  Steiner  nodes,  an  instance 
of  Relay  Placement  can  be  easily  cast  as  an  instance  of  the  Node-Weighted  Steiner  Tree  problem.  The 
general  Node -Weighted  Steiner  Tree  problem  is  inapproximable  to  within  a  factor  of  Of  log  n)  [16].  For  the 
class  of  line-of-sight  networks  that  we  study  here,  however,  we  show  how  to  exploit  the  underlying  visibility 
structure  to  obtain  a  constant-factor  approximation.  In  particular,  we  make  use  of  a  graph-theoretic  notion 
that  we  call  cohesiveness,  which  suggests  some  combinatorial  questions  of  independent  interest. 

Relay  Placement  is  clearly  related  to  certain  algorithmic  art-gallety  problems  (see  e.g.  [8,  9]  and  the 
VC-dimension  results  in  [15,  23]),  since  there  too  one  is  placing  nodes  in  a  region  subject  to  visibility 
constraints.  However,  the  problems  considered  in  the  literature  on  art-gallery  problems  have  a  different 
focus,  as  they  are  concerned  with  placing  nodes  so  as  to  see  the  entire  region,  as  opposed  to  adding  Steiner 
nodes  so  as  to  create  a  connected  visibility  graph,  as  we  do  here. 

2  Connectivity 

This  section  is  devoted  to  the  proof  of  Theorem  1.1.  We  will  concentrate  first  on  the  case  where  cn  — >  c  and 
to  avoid  trivialities  we  will  assume  that  cn  =  c.  Thus  until  further  notice,  we  will  assume  that 

(1  —  i<5)  In  n  +  4  In  In  n  +  c 
P= - - - 2oj - ' 
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The  overall  outline  of  the  proof  is  as  follows.  We  imagine  adding  nodes  in  two  stages  —  most  of  the 
nodes  in  the  first  stage,  and  a  few  final  nodes  in  the  second  stage.  Now,  suppose  the  graph  H  formed  by 
nodes  added  in  the  first  stage  can  be  disconnected  by  the  deletion  of  some  set  S  of  fewer  than  k  nodes.  We 
argue  that  with  high  probability,  any  two  components  J  and  K  of  H  —  S  come  “close”  to  one  another  at 
many  disjoint  locations  on  the  torus  T  —  in  particular,  at  each  of  these  locations,  there  is  some  point  of  the 
torus  that  sees  nodes  in  both  J  and  K.  When  we  then  add  nodes  in  the  second  stage,  it  is  enough  that  a 
node  is  placed  at  one  of  these  points  that  can  see  both  components;  and  we  argue  that  there  arc  enough  such 
points  that  this  happens  with  high  probability. 


2.1  Minimum  degree  computation 

We  first  show,  by  analogy  with  the  random  graph  Gn<p,  that  the  threshold  for  connectivity  coincides  with 
the  threshold  for  there  being  no  vertices  of  degree  less  than  k.  The  proof  is  given  in  the  appendix. 

Proposition  2.1  lim  Pr(6'  contains  a  vertex  of  degree  <  k)  =  1  —  e-Afc. 

n— >oo 


2.2  Probabilistic  part  of  proof 

We  imagine  placing  nodes  at  random  according  to  the  following  two-stage  process.  We  place  a  node  at  each 
point  with  probability  p\  in  the  first  stage.  We  then  independently  place  a  node  at  each  point  with  probability 
p-2  in  the  second  stage.  We  choose 

(1  —  i<5)  In  n  +  4  In  In  n  +  c  —  (In  n)~l  In  n 

p  i  = - - - - - >  - 

1  2a ;  “3a; 

and  p2  so  that  this  is  equivalent  to  the  original  placement  process  with  probability  p,  in  which  case 

1 

P2  ~  -x— ; - • 

2a;  in  n 

For  ease  of  terminology,  we  say  that  a  node  is  red  if  it  was  placed  in  the  first  stage,  and  we  say  that  it  is 
blue  if  it  is  placed  in  the  second  stage  at  a  point  not  hit  by  the  first  stage.  Let  H  denote  the  subgraph  of  G 
consisting  only  of  red  nodes. 

For  each  point  in  T,  we  define  its  four  arms  to  be  the  four  sets  of  uj  points  that  are  visible  from  it 
in  a  single  direction  (north,  south,  east,  and  west).  We  further  partition  each  arm  a  of  point  x  into  10 
consecutive  segments  a\,  a2,  ■  ■  ■ ,  aio  of  length  u/ 10.  A  segment  is  said  to  be  weak,  otherwise  strong,  if  it 
contains  fewer  than  ^  In  n  red  nodes.  An  arm  is  said  to  be  mighty  if  all  its  segments  are  strong.  We  first 
claim 


Lemma  2.2  With  high  probability  there  does  not  exist  a  red  node  which  has  an  arm  a  on  which  we  can  find 
1000  red  vertices,  each  having  an  arm  orthogonal  to  a  which  is  not  mighty. 

Proof  For  a  fixed  point  x  and  arm  a,  the  probability  that  the  arm  contains  a  weak  segment  can  be  bounded 
by 

Bin(u>/10,pi)  <  —  lnra^j  <  e^(llin)/400  =  n-i/400. 

50  J 

So  the  probability  that  there  is  a  red  node  as  described  in  the  statement  is  bounded  by 

Vr0--10007400  = o(i).  ■ 
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Lemma  2.3  With  high  probability  H  does  not  contain  a  vertex  v  of  degree  less  than  In  In  n  that  has  a 
neighbor  w  such  that  w  contains  an  arm  orthogonal  to  vw  which  is  not  mighty. 


Proof.  The  probability  that  H  contains  such  a  pair  v,  w  is  bounded  by 


In  In  n  / 

n2pi  ( 

4aA  t 

t  h 

t= 1  ' 

In  In  n  , 

<  2n-1/400 

E 

t= 1  ' 

=  0(1) 

■. 

(4  +  o(l))elnn^  e_2c+o(i) 


Lemma  2.4  With  high  probability  H  does  not  contain  a  red  vertex  with  at  most  k  —  1  red  neighbors  and  at 
least  one  blue  neighbour. 

Proof.  The  probability  that  H  contains  such  a  vertex  v  is  bounded  by 

k — 1  /  t  \ 

n2pi  x;  (  ^  W(1  -pi)4w_t(4wp2)  ~  4Afeu;p2  =  o(l).  ■ 

t= o  '  ' 


Lemma  2.5  With  high  probability  H  does  not  contain  a  blue  vertex  with  fewer  than  k  red  neighbours. 
Proof.  The  probability  that  H  contains  such  a  vertex  v  is  bounded  by 

n2p,  £  ~  —  =  0(1).  ■ 

Let  £],  £ 2 ,  £3,  and  £4  denote  the  events  that  the  properties  in  Lemmas  2.2,  2.3,  2.4,  and  2.5  respectively 
hold. 


2.3  Non-probabilistic  part  of  proof 

For  this  next  paid,  we  assume  that  the  high-probability  events  considered  thus  far  all  occur;  in  particular,  we 
assume  that  5(G)  >  k  and  that  £\,  £2,  £3,  and  £4  all  hold. 

Recall  that  H  is  the  subgraph  of  G  consisting  only  of  the  red  nodes.  Let  S  be  an  arbitrary  set  of  k  —  1 
red  vertices,  and  let  Hs  =  H  —  S.  Our  goal  is  to  show  that  if  Hs  has  multiple  connected  components,  then 
with  high  probability  they  will  all  be  linked  up  by  the  addition  of  the  blue  nodes. 

Let  L  be  the  set  of  points  in  T  with  coordinates  (i,j),  where  each  of  i  and  j  is  a  multiple  of  3c 0.  For 
each  connected  component  K  of  Hs,  and  for  each  point  x  G  L,  let  vkx  denote  the  node  in  K  that  is  closest 
to  x  in  L\  distance.  We  claim 

Lemma  2.6  vkx  kes  within  the  ut  x  lo  box  Bx  centered  at  x. 
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Proof.  Let  a  red  node  be  pink  if  it  is  not  in  S.  Assume  without  loss  of  generality  that  the  point  x  is  located 
at  the  origin  of  the  torus,  which  we  denote  x  =  0.  Suppose  that  v  =  vko  =  (a,  b)  is  N-E  of  0  and  that  it 
does  not  lie  in  Bq.  v  has  at  least  one  arm  containing  a  pink  node  w.  This  follows  from  the  occurrence  of  £3. 
If  the  degree  of  v  is  less  than  In  In  n  then  we  can  use  the  non-occurrence  of  £2  to  argue  that  the  two  arms  of 
w  orthogonal  to  vw  are  mighty.  If  the  degree  of  v  is  greater  than  In  In  n  then  we  can  use  the  non-occurrence 
of  £\  to  argue  that  there  is  a  choice  of  In  In  n  —  4000  w’ s  such  that  the  two  arms  of  w  orthogonal  to  vw  are 
mighty.  Let  a  denote  the  arm  of  v  containing  a  w  with  mighty  arms.  Note  that  every  segment  of  a  mighty 
arm  contains  at  least  ^  In  n  pink  nodes. 

Case  1:  cr  is  the  South  arm  of  v. 

If  a  <  cu/2  then  any  pink  node  on  a  is  either  in  Bq  or  closer  to  0  than  vko-  Similarly,  if  b  >  lo/2  then 
any  pink  node  on  a  is  closer  to  0  than  vko-  So  we  can  assume  that  a  >  lo/2  >  b.  Also,  if  (a,  b ')  G  a  then 
we  must  have  0  >  b'  =  —  b"  where  we  can  assume  that  b  <  b"  <  10  —  b. 

Choose  such  a  pink  node  (a,  —b")  with  a  mighty  West  arm  (3 .  Now  choose  a  pink  node  w  =  ( a —b”)  G 
ft  such  that  (i)  a  —  o'  G  [.4tu,  .5cu]  and  (ii)  the  North  arm  7  of  w  is  mighty.  Now  choose  a  pink  node 
(a',  c)  G  7  such  that  \c  —  b\  <  .leu.  It  follows  that  |a'|  +  |c|  <  a  +  b  +  .leu  —  .4eu,  contradiction. 

Case  2a:  a  is  the  North  arm  of  v  and  a  >  lo/2. 

Choose  a  pink  node  (a,  b ')  G  a  with  a  mighty  West  ai  m  6.  Then  choose  a  pink  node  w  =  (a',  b')  G  ft 
such  that  (i)  a  —  a'  G  [.4eu,  .5eu]  and  (ii)  the  South  arm  7  of  w  is  mighty.  Now  choose  a  pink  node  ia'i  b")  G  7 
such  that  | b"  —  b\  <  .leu.  It  follows  that  |a'|  +  \b"\  <  a  +  b  +  .leu  —  ,4eu,  contradiction. 

Case  2b:  a  is  the  North  ai  m  of  v  and  a  <  lo/2. 

We  must  have  b  >  lo/2,  else  vko  £  Bq.  Choose  a  pink  node  (a,  //  )  G  a  with  a  mighty  West  arm  Q. 
Then  choose  a  pink  node  w  =  ( a' ,  b')  G  /?  such  that  (i)  |a  —  a'\  <  .leu  and  (ii)  the  South  arm  7  of  w  is 
mighty. 

If  |6  —  b' |  <  .7 lo  then  choose  a  pink  node  ( a',b ")  G  7  such  that  | b"  —  b\  G  [.9eu,eu].  It  follows 
that  |a'|  +  | b"\  <  a  +  b  +  .leu  +  .7eu  —  .9eu,  contradiction.  Otherwise,  | b  —  b'\  >  .7 lo.  We  can  choose 
a  pink  node  y  =  (. a',b ")  G  7  such  that  the  West  arm  <)  of  y  is  mighty  and  | b'  —  b"\  >  .9 lo.  Choose 
a  pink  node  z  =  {a" ,b")  G  5  such  that  \a"  —  a'\  <  .leu  and  its  South  arm  e  is  mighty.  Finally,  we 
note  that  there  exists  a  pink  node  w  =  (a" .  b"r)  G  e  such  that  \b"  —  b”'  G  [.5eu,  .6eu].  Then  we  have 
|  a"  |  +  \b’"\  <a  +  6  +  eu-|-.leu  —  .9eu  +  .leu  —  .5eu,  contradiction. 

The  case  where  a  is  the  West  arm  is  dealt  with  as  in  Case  1  and  the  case  where  a  is  the  east  arm  is  dealt 
with  as  in  Case  2.  ■ 

Now,  let  J  and  K  be  two  distinct  component  of  Hg.  Since  vjx  and  vkx  both  lie  in  the  eu  x  eu  box  around 
x ,  there  is  some  point  z(J,K ,  x)  that  is  visible  from  both  of  them.  We  observe  that 

Lemma  2.7  The  points  z(J,  K,  x)  and  z(J,  K.  y)  are  distinct,  for  distinct  points  x,y  G  L. 

Proof.  z(  J.  I\,  x)  lies  in  the  cu  x  lo  box  around  x,  and  z(J,  K,  y)  lies  in  the  cu  x  lo  box  around  y,  and  these 
boxes  arc  disjoint,  since  x  and  y  arc  at  least  3cu  apart.  ■ 

2.4  Finishing  the  proof 

Note  that  if  a  node  is  placed  at  z(J,  K,  x),  then  it  will  be  a  neighbor  both  of  a  point  in  J  and  K ,  and  hence 

J  and  K  will  belong  to  the  same  component  in  G.  In  the  second  stage  of  node  placement,  a  blue  node  will 

2 

be  placed  at  each  point  z(J ,  K,  x)  with  probability  77.  By  Lemma  2.7 ,  there  arc  f-jj  such  points  for  a  fixed 
pair  of  components  J,  K,  and  so  the  probability  that  no  blue  point  is  placed  at  any  of  them  is  bounded  by 

(1  —  p2)n2/(9aj2)  <  e-™2/(20a;3lnn)  <  g-n2-3V( 20 Inn) 
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There  arc  at  most  cu2  components,  since  for  any  fixed  point  x  €  L,  each  component  has  a  node  in  the  wxw 
box  around  x. 

Thus,  the  probability  that  there  exists  a  set  S  of  size  at  most  k  —  1  and  components  J,  K  of  H$,  which 
are  not  connected  in  G  by  a  blue  vertex  is  at  most  uj4e~n  /(20inn)n2fc-2  _  0(T)_  Thus,  conditional  on 
there  being  no  vertices  of  degree  /;:  —  1  or  less,  if  we  remove  any  set  S  of  k  —  1  vertices,  then  with  high 
probability  the  graph  H$  has  a  component  containing  all  of  the  red  vertices.  It  follows  from  £4  that  G  —  S 
is  connected  and  so  G  itself  is  ^-connected  with  high  probability. 

This  finishes  the  case  cn  — >  c.  If  cn  — >  —  00  then  one  uses  the  Chebyshev  inequality  to  show  that  with 
high  probability  there  are  vertices  of  degree  less  than  k.  If  cn  — >  00  then  with  high  probability  there  are  no 
vertices  of  degree  less  than  k  (the  expected  number  tends  to  zero),  and  the  argument  for  cn  — >  c  implies  that 
G  will  be  /;; -connected  with  high  probability. 

This  completes  the  proof  of  Theorem  1.1. 


3  The  Existence  of  a  Giant  Component 

We  now  consider  the  existence  of  a  giant  component  in  our  model  of  line-of-sight  networks.  Note  here  that 
since  G  itself  has  O (n2p)  vertices,  a  giant  component  is  one  with  (.i(n2p)  vertices. 

Theorem  3.1 

(a)  If  p  =  where  c  >  1  and  uj  — >  00  then  with  high  probability  G  contains  a  component  with  (1  — 

o(l))(l  —  x2c)n2  /uj  vertices,  where  xc  is  the  unique  solution  in  (0, 1)  ofxe~x  =  ce~c. 

(b)  Ifp  =  where  c  <  l/(4e)  and  uj  —>  00  then  with  high  probability  the  largest  component  in  G  has  size 

O(lnn). 

To  prove  paid  (a)  of  the  theorem,  we  first  require  a  lemma  about  the  existence  of  a  giant  component  in 
the  random  graph  H  =  llm  ,m.q  where  q  =  d/m.  Here  we  create  H  by  including  each  edge  of  the  complete 
bipartite  graph  Krn  rn  independently  with  probability  q. 

Lemma  3.2  If  d  >  1  then  with  high  probability  H  contains  a  component  Cg  with  (1  —  o(l))(l  —  xf)m 
vertices  on  each  side  of  the  partition,  where  xj  is  the  unique  solution  in  (0, 1)  ofxe~x  =  de~d.  Furthermore 
Cg  contains  (1  —  o(l))(l  —  x/frn  edges. 

Proof.  We  follow  the  proof  of  the  existence  of  a  giant  component  via  branching  processes  as  elaborated  in 
Chapters  10.4  and  10.5  of  Alon  and  Spencer  [1],  Note  that  the  degree  of  a  vertex  of  H  has  a  distribution 
which  is  asymptotically  Poisson  with  mean  d  and  the  proof  in  [1]  can  easily  be  adapted  to  H.  This  will  show 
that  Cg  has  ~  (1  —  xf)m  vertices  on  each  side.  To  get  the  number  of  edges,  imagine  the  model  where  we  fix 
the  number  of  edges  as  p  ~  dm.  Suppose  now  we  put  in  //  —  1  random  edges  and  obtain  a  giant  component 
C'g  with  (1  —  o(l)) (1  —  xf)m  vertices  on  each  side.  Now  put  in  the  //th  random  edge.  The  probability  it  is 
not  paid  of  the  giant  component  Cg  is  ~  xd ■  This  shows  \E(Cg)\  ~  (1  —  x2f)m  in  expectation.  By  adding 
two  random  edges  we  can  estimate  the  variance  and  then  use  Chebyshev ’s  inequality.  ■ 

Now  divide  the  torus  T  into  N  =  n2 /u2  sub-squares  Si,  S2> _ ,  Sjv  of  size  uj  x  u.  Fix  a  particular  sub¬ 

square  Sj  and  consider  the  bipartite  graph  Hi  with  lo  +  uj  vertices  II,  U  C,  (rows/columns)  where  there  is  an 
edge  (x,  y )  G  R,  x  Ct  if  the  gridpoint  of  T  corresponding  to  (x.  y )  is  occupied  by  a  node  of  G.  Applying 
Lemma  3.2  with  m  =  uj  and  d  =  c  we  see  that  with  probability  (1  —  o(l)),  H ,  contains  a  giant  component 
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Figure  1 :  The  sub-squares  used  in  the  analysis  of  the  giant  component. 


T,:  with  (1  —  o(l))(l  —  xc)u ;  vertices  on  each  side  and  (1  —  o(l))(l  —  x2)u2  edges.  When  translated  into  a 
subgraph  of  G,  we  see  that  Hi  induces  a  subgraph  G,  with  (1  —  o(l))(l  —  x2)lu2  vertices.  This  is  because 
each  edge  of  H%  corresponds  to  a  vertex  of  G. 

We  divide  each  sub-square  S,  further  into  16  u/ 4  xw/4  sub-squares.  We  choose  4  special  sub-squares 
Sip, . . . ,  Sip.  These  will  either  be  at  (1,  2),  (2, 1),  (3, 4),  (4,  3)  or  at  (1, 3),  (2,4),  (3, 1),  (4,  2)  where  (i,j) 
denotes  the  sub-square  in  row  i,  column  j,  1  <  i.j  <  4.  We  then  have  these  two  sorts  of  sub-square 
alternate  along  the  rows  and  columns  of  T  as  in  Figure  1. 

Each  special  sub-square  is  associated  with  a  direction.  I f  i  =  1  then  the  direction  is  North.  If  i  =  4  then 
the  direction  is  South.  If  j  =  1  then  the  direction  is  West  and  if  j  =4  then  the  direction  is  East. 

Now  with  high  probability  each  of  the  4  special  sub-squares  will  contain  ~  (1  —  xc)uj/4  useable  columns 
(North  or  South  sub-squares)  or  rows  (East  or  West  sub-squares)  that  correspond  to  vertices  of  a  giant 
component  of  the  corresponding  //,.  We  say  that  a  square  Si  is  good  if  Hi  contains  a  giant  component  with 
~  (1  —  x2)lo2  edges  and  each  special  sub-square  has  ~  (1  —  xc)uj/A  useable  rows  or  columns,  depending 
on  its  direction. 

If  Hi  is  good  then  we  choose  (1  —  xc)oj/h  random  rows  or  columns  from  the  useable  rows  or  columns 
of  each  the  four  special  sub-squares.  Let  Xij  be  the  set  of  rows  or  columns  chosen  from  ,5)  j .  We  observe 
that  conditional  on  Sl  being  good,  the  sets  Xr  ]  arc  uniformly  random  and  independent  of  each  other. 

We  arc  now  in  a  position  to  use  mixed  percolation.  Let  C  denote  the  n/uj  x  n/uj  lattice  £  with  site 
percolation  py  =  1  —  o(l)  and  bond  percolation  py  =  1  —  o(l).  Here  we  place  a  vertex  at  site  i  is  the  square 
Si  is  good.  If  two  adjacent  sites  Hi,  Hl+\  say  are  good  then  we  join  them  by  an  edge  in  the  lattice  if  the 
following  holds:  Let  the  adjacent  special  squares  be  Sl>r  and  ,S)+i:.s.  We  add  the  edge  if  Xl/r  n  Xj+i)S  /  0. 
If  this  occurs  then  there  arc  a  pair  of  nodes  of  G,  u  €  T,,  v  G  F,+ 1  such  that  u,  v  arc  in  the  same  row  or 
column  and  arc  at  distance  <  oj  apart.  Hence  F,  and  F,+i  will  form  paid  of  the  same  component  in  G. 

In  this  model  of  percolation  the  giant  cluster  will  contain  almost  all  of  the  points;  for  example  this 
follows  from  a  simple  generalisation  of  Theorem  1.1  of  Deuschel  and  Pisztora  [7],  In  which  case  almost  all 
of  the  giants  F,  will  paid  of  the  same  component  of  G.  This  completes  the  proof  of  paid  (a)  of  Theorem  3.1. 

To  prove  paid  (b)  of  the  theorem,  we  first  note  that  an  r-rcgular,  Ar-vertcx  graph  contains  <  N(er)1'  1 
trees  with  k  vertices.  This  is  proved  for  example  in  Claim  1  of  [10].  Thus  the  expected  number  of  /.--vertex 
trees  in  G  is  bounded  by  n2(4eccp)fc_1  =  n2(4ec)fc_1  =  o(l)  if  k  >  A\nn  and  A  is  sufficiently  large. 

4  Finding  Paths  Between  Nodes 

Thus  far,  we  have  considered  the  existence  of  paths  between  nodes  in  random  line-of-sight  networks.  In 
terms  of  the  motivating  applications,  it  is  also  interesting  to  consider  the  algorithmic  problem  faced  by  a 


pair  of  nodes  s  and  t  trying  to  construct  a  path  between  them  in  such  a  network.  We  consider  a  decentralized 
model  in  which  each  node  knows  only  its  own  coordinates  and  those  of  its  neighbors  in  G\  given  the 
coordinates  of  t,  the  node  s  must  pass  a  message  to  t  by  forwarding  it  through  a  sequence  of  intermediate 
nodes.  We  consider  the  standard  goal  in  wireless  ad-hoc  routing:  we  wish  to  construct  an  s-t  path  with  a 
small  number  of  edges,  while  consulting  a  small  number  of  intermediate  nodes  [22]. 

We  show  that  it  is  possible  to  find  good  paths  by  decentralized  means  when  the  placement  probability  p 
is  a  constant  factor  larger  than  the  threshold  for  connectivity. 

Theorem  4.1  Let  p  =  C  In  n/uj  for  a  constant  C  to  be  specified  below.  There  is  a  decentralized  algorithm 
that,  given  s  and  t,  with  high  probability  constructs  an  s-t  path  with  0(d{s,t)/ui  +  Inn)  edges  while 
involving  0(d(s,  t)/ui  +  co  In  n)  nodes  in  the  computation. 

We  note  that  this  bound  is  nearly  optimal,  since  Ll(d(s ,  t) / to)  is  a  simple  lower  bound  on  the  number 
of  edges  and  the  numbers  of  nodes  involved  in  any  s-t  path.  For  example,  if  s  and  t  are  selected  at  random 
(so  that  d(s,  t )  is  linear  in  expectation),  then  given  our  upper  bounds  on  ui  from  Section  2,  both  bounds  in 
Theorem  4.1  arc  0(d(s,  t)/oj),  since  the  other  terms  are  of  asymptotically  lower  order. 

To  begin  the  proof  of  Theorem  4.1,  let  N  =  n2  and  let  Si,  S2,  ■  ■  ■ ,  Sn  be  the  collection  of  all  w  x  w 
sub-squares  obtained  by  choosing  to  consecutive  rows  and  columns.  Let  Gi,  Hi,  i  =  1, 2, . . . ,  N  be  defined 
similarly  to  that  done  in  Section  3.  We  first  observe  the  following. 

Lemma  4.2  (a)  With  high  probability  G\ ,  G'2 , . . . ,  Gjy  are  all  connected,  (b)  With  high  probability  the 
diameter  of  Gi  is  at  most  D  In  n,  i  =  1,  2, . . . ,  N,  where  D  is  some  absolute  constant. 


Proof.  The  proof  of  (a)  is  simple.  G\  is  connected  iff  H 1  is  connected.  If  II 1  is  not  connected  then  then 
there  exist  non-empty  subsets  K  C  Ri,  L  C  C\ ,  \K\  +  \L\  <  to  such  that  K  U  L  induces  a  connected 
component  of  If.  The  probability  that  such  a  pair  exist  is  at  most 


£ 
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2 

<  - 
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2 <k+t<n 


pk+t~  1(2  _  p^k{ui-e)+t{uj-k) 
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-  E 

^  2<k+l<n 

-  £ 
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jclnn  ^ 

' e2C\wn\  k+i 


1  - 


2 kt  \  \ 
u)(k+t) )  J  , 


n 


C/2 


=  0((lnn)2um  c). 


So  if  C  >  3  we  can  inflate  this  latter  probability  estimate  by  n2  /lo  to  account  for  all  of  G\,  G2,  ■  ■  ■ ,  Gn- 
The  proof  of  paid  (b)  is  more  involved,  but  it  is  a  standard  calculation;  see  for  example  Bollobas  and 
Klee  [5],  ■ 


The  next  thing  we  observe  is  that  we  can  now  assume  that  all  arms  of  all  vertices  arc  mighty.  This  is 
again  a  simple  calculation,  similar  to  that  given  for  the  proof  of  (2.2).  This  also  allows  us  to  specify  the 
value  of  C  in  the  expression  p  =  C  In  n/u:  it  should  be  large  enough  for  Lemma  4.2  to  hold  and  for  all 
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arms  of  all  nodes  to  be  mighty.  (In  fact,  as  will  be  clear  from  the  subsequent  discussion,  we  will  need  only 
a  weak  valiant  of  mightiness  in  the  analysis.) 

We  now  describe  the  decentralized  algorithm  to  pass  a  message  from  a  node  s  to  a  node  t  (thereby 
constructing  an  s-t  path).  The  algorithm  consists  of  two  stages.  First,  starting  at  s,  the  message  is  passed 
between  nodes  on  the  row  of  s,  moving  the  “short  way”  around  the  torus  toward  the  column  of  t.  Each  node 
passes  the  message  to  its  farthest  neighbor  on  the  arm  in  the  correct  direction;  since  all  arms  arc  mighty, 
the  message  travels  an  L\ -distance  of  at  least  u/2  in  each  step.  This  process  stops,  at  a  node  u,  when  the 
message  is  about  to  “overshoot”  the  column  of  t.  At  this  point,  the  message  is  then  passed  between  nodes 
in  the  column  of  u,  according  to  the  same  rule.  This  process  stops  when  the  message  is  about  to  overshoot 
the  row  of  t. 

The  second  stage  now  begins,  with  the  message  at  a  node  v  that  belongs  to  a  subset  B  of  size  ujxlu,  such 
that  B  also  contains  t.  The  message  is  now  propagated  by  breadth-first  search  to  all  nodes  within  D  In  n 
steps,  but  only  including  nodes  that  belong  to  the  set  B.  Here  I)  is  the  constant  from  Lemma  4.2.  (Note  that 
by  our  assumption  that  nodes  know  the  coordinates  of  themselves  and  their  neighbors,  a  node  can  determine 
which  subset  of  its  neighbors  lie  in  B  and  hence  should  be  included  in  the  BFS.)  By  Lemma  4.2,  the  node 
t  will  be  reached  by  this  BFS,  since  the  subgraph  of  G  restricted  to  B  is  connected  and  with  appropriately 
short  paths. 

The  bound  on  the  number  of  edges  in  the  resulting  s-t  path  follows  directly  from  the  definition  of  the 
two  stages.  To  bound  the  number  of  nodes  involved  in  the  computation,  we  observe  that  0(d(s,  t)/uj)  nodes 
are  involved  in  the  first  stage,  and  the  second  stage  involves  at  most  the  total  number  of  nodes  in  B,  which 
is  0(pu2)  =  0(uj\nn)  with  high  probability. 

5  Relay  Placement:  An  Approximation  Algorithm 

Finally,  we  discuss  an  approximation  result  for  the  Relay  Placement  problem:  given  a  set  of  nodes  on  a  grid, 
we  would  like  to  add  a  small  number  of  additional  nodes  so  that  the  full  set  becomes  connected.  As  before, 
we  are  given  an  n  x  n  torus  of  points  T.  Let  K  =  ( T,E )  be  the  graph  defined  on  the  points  of  T,  in  which 
we  join  two  points  by  an  edge  if  they  can  see  one  another.  Also,  we  are  given  a  cost  cx  for  each  point  x  £  T, 
and  for  a  set  X  C  T  we  define  c(X)  =  Jfxe  x  cx- 

Let  X  =  {x±,X2,  ■  ■  ■  ■  Xf;  }  be  a  given  set  of  points  in  T.  We  consider  the  problem  of  choosing  a  set  of 
additional  points  Y  =  {y\ .....  ys }  such  that  I\  [X  U  Y]  is  a  connected.  We  call  V  a  Steiner  set  for  X ;  nodes 
placed  at  Y  can  act  as  “relays”  for  an  initial  set  of  terminal  nodes  placed  at  X.  Our  goal  is  to  find  a  Steiner 
set  whose  total  cost  as  small  as  possible. 

This  is  an  instance  of  the  Node-Weighted  Steiner  Tree  problem  in  the  graph  K,  with  X  as  the  set  of 
given  terminals  and  Y  as  the  set  of  additional  Steiner  nodes  whose  total  cost  we  want  to  minimize.  In 
general,  there  is  an  f2(log  n)  hardness  of  approximation  for  this  problem  [16]  (and  this  is  matched  in  [16]  by 
a  corresponding  upper  bound).  However,  the  special  structure  of  the  graph  K  allows  us  to  efficiently  find  a 
Steiner  set  whose  cost  is  within  a  constant  factor  of  minimum.  This  is  the  content  of  the  following  theorem, 
which  we  prove  in  the  remainder  of  the  section. 

Theorem  5.1  There  is  a  polynomial-time  algorithm  that  produces  a  Steiner  set  whose  total  cost  is  within  a 
factor  of  6.2  of  optimal. 

The  crucial  combinatorial  property  of  K  that  we  use  is  captured  by  the  following  definition.  We  say  that 
a  graph  H  is  d-cohesive  if  every  connected  subset  of  H  has  a  spanning  tree  of  maximum  degree  d.  That  is, 
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given  any  connected  subset  S  of  V(H),  we  can  choose  a  set  F  of  edges,  each  with  both  ends  in  S,  such  that 
(S,F)  is  a  tree  of  maximum  degree  d. 

We  note  that  it  is  easy  to  construct  graphs  that  arc  not  d-cohesive  for  any  specified  d;  for  example,  any 
graph  containing  an  induced  K\cj+\  is  not  d-cohesive.  In  fact,  although  it  is  not  crucial  for  our  purposes 
here,  we  note  that  the  cohesiveness  is  a  combinatorial  property  of  G  that  is  almost  entirely  characterized  by 
this  particular-  type  of  obstruction;  if  we  let  k(G)  denote  the  minimum  d  for  which  G  is  d-cohesive  and  we 
let  i~p(G)  denote  the  maximum  t  for  which  G  contains  an  induced  AY/,  then  we  can  prove  the  following. 

Proposition  5.2  ip(G )  <  k(G)  <  <p(G )  +  1. 

Returning  to  the  line-of-sight  graph  K,  a  direct  application  of  Proposition  5.2  implies  that  K  is  5- 
cohesive.  With  somewhat  more  care,  we  can  show 

Lemma  5.3  The  graph  I\  is  d-cohesive. 

Proof.  A  direct  application  of  Proposition  5.2  implies  that  K  is  5-cohesive,  but  we  can  do  better  via  the 
following  argument.  For  each  edge  of  AT,  define  its  length  to  be  the  number  of  rows  or  columns  of  T  that 
separate  its  ends.  Now,  consider  an  arbitrary  connected  subset  S  of  K,  and  let  (5,  F )  be  a  spanning  tree  of 
S  whose  total  edge  length  is  minimum. 

We  claim  that  the  maximum  degree  of  (5,  F)  is  four.  For  suppose  not;  then  some  node  a  G  S  has  degree 
at  least  five,  and  hence  there  are  two  nodes  v.  w  G  .S'  that  lie  on  the  same  arm  of  u,  and  for  which  (u.  v) 
and  (u.  w)  are  both  edges  in  F.  In  other  words,  u,  v,  w  lie  in  the  same  row  or  column  of  T,  in  this  order, 
and  u  and  w  are  close  enough  to  see  one  another.  It  follows  that  (v,  w )  is  also  an  edge  of  K.  But  now 
(S,  F  U  { (v.  w) }  —  { (u,  w) }  is  a  spanning  tree  of  S  whose  total  length  is  strictly  less  than  that  of  (S,  F),  a 
contradiction.  ■ 

We  now  describe  the  approximation  algorithm  and  its  analysis.  We  first  define  weights  on  the  edges  of 
K  as  follows.  First,  we  say  that  the  X-reduced  cost  c{ f  of  a  node  v  is  equal  to  0  if  v  €  X,  and  equal  to  cv 
otherwise.  We  define  cx  (Y)  =  f2y(zY  cy  •  F°r  cac'h  edge  e  =  (v.  w )  of  K,  we  define  its  weight  we  to  be 
max(qf ,  cx ).  For  a  subgraph  A  of  A,  let  w( A)  denote  its  total  edge  weight. 

Now,  let  Y*  be  a  Steiner  set  for  X  of  minimum  cost,  and  let  A*  be  a  Steiner  free  for  X  of  minimum 
total  edge  weight.  (Note  that  the  Steiner  nodes  of  A*  may  be  different  from  Y* .)  The  4-cohesiveness  of 
K  implies  a  corresponding  gap  of  4  between  the  cost  of  the  optimal  Steiner  set  Y*  and  the  weight  of  the 
optimal  Steiner  tree  A*. 

Lemma  5.4  w( A*)  <  4c(Y*). 

A  Steiner  tree  whose  edge  weight  is  within  a  constant  factor  7  <  1.55  of  optimal  can  be  computed  in 
polynomial  time  via  an  algorithm  from  [21].  Let  A'  be  a  Steiner  tree  for  X  computed  using  this  algorithm. 
Let  Y'  be  the  Steiner  nodes  of  A'.  By  charging  the  costs  of  nodes  in  Y'  to  the  weights  of  distinct  incident 
edges  in  A',  we  have 

Lemma  5.5  c(Y')  <  w(A'). 

Finally,  we  use  Y'  as  our  Steiner  set  for  X.  Using  Lemma  5.4  and  Lemma  5.5,  together  with  the 
approximation  guarantee  for  the  edge  weight  of  A',  we  obtain  a  bound  of  47  <  6.2  on  c(Y')  relative  to  the 
optimum  c(Y*): 

c(Y')  <  w(A')  <  jw(A*)  <  47 c(Y*). 
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Appendix 


Proof  of  Proposition  2.1.  Let  AQ  denote  the  number  of  vertices  of  degree  0  <  l  <  k.  Then  observe  first 
that 


Ep Q]  =  n2p(^^p\l  -p)4uJ  1 

2  (1  — i6)lnn  4lu;1 

~  n  ■ - - - •  — — 

2  u  l\ 

Jo  l<k- 2 
1  Afc  l  =  k  -  1 


/ (1  —  7p')lnn\  nse  2c 
y  2(x>  J  ri2(\nn)k 


Thus  the  expected  number  of  vertices  of  degree  less  than  k  is  asymptotically  A/-.  The  rest  of  the  proof  is 
quite  standard.  Let  ,S'/,.  denote  the  set  of  vertices  of  degree  less  than  k  in  G  and  let  X  =  Sf:  \ .  Let  X"  denote 
the  number  of  pairs  of  vertices  v,  w  G  Sf.  such  that  v.  w  arc  within  t\  distance  2tu  of  each  other.  Let  X' 
denote  the  number  of  vertices  in  ,S’/.  which  arc  at  l\  distance  greater  than  2t o  from  any  other  vertex  in  Sf... 
Then 


X'  <  X  <  X'  +  X”. 


Now 

using  our  upper  bound  on  <5.  Thus  X  =  X'  with  high  probability. 

Now  fix  a  positive  integer  t.  Then,  where  ( a)t  =  a(a  —  1)  •  •  •  (a  —  t  +  1),  we  compute 


E  [X"]  <  16c u2nV 


fc-i 


k- 1 


(n2  —  16tu2)p  y  p\i-p) 

\  i= 0 


4a j—i 


<E[(X')t]<  n2p^p*(l-p) 


\Auj—i 


i= 0 


which  implies  that 

lim  E  [(X')t]  =  A 1 

n— kx)  J 

and  so  X'  is  asymptotically  Poisson  with  mean  A/,.,  which  implies  the  lemma.  ■ 
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